Llama 3.3 Swallow 70B Instruct V0.4
Llama 3.3 Swallow is a large language model (70B) based on continuous pre-training of the Meta Llama 3.3 model, enhancing Japanese capabilities while retaining original English proficiency.
Large Language Model
Transformers Supports Multiple Languages